A Multi-Scale Generalization of the HoG and HMAX Image Descriptors for Object Detection
نویسندگان
چکیده
Recently, several powerful image features have been proposed which can be described as spatial histograms of oriented energy. For instance, the HoG [5], HMAX C1 [14], SIFT [13], and Shape Context feature [4] all represent an input image using with a discrete set of bins which accumulate evidence for oriented structures over a spatial region and a range of orientations. In this work, we generalize these techniques to allow for a foveated input image, rather than a rectilinear raster. It will be shown that improved object detection accuracy can be achieved via inputting a spectrum of image measurements, from sharp, fine-scale image sampling within a small spatial region within the target to coarse-scale sampling of a wide field of view around the target. Several alternative feature generation algorithms are proposed and tested which suitably make use of foveated image inputs. In the experiments we show that features generated from the foveated input format produce detectors of greater accuracy, as measured for four object types from commonly available data-sets. Finally, a flexible algorithm for generating features is described and tested which is independent of input topology and uses ICA to learn appropriate filters.
منابع مشابه
Performance evaluation of block-based copy- move image forgery detection algorithms
Copy-move forgery is a particular type of distortion where a part or portions of one image is/are copied to other parts of the same image. This type of manipulation is done to hide a particular part of the image or to copy one or more objects into the same image. There are several methods for detecting copy-move forgery, including block-based and key point-based methods. In this paper, a method...
متن کاملVisual Tracking using Learning Histogram of Oriented Gradients by SVM on Mobile Robot
The intelligence of a mobile robot is highly dependent on its vision. The main objective of an intelligent mobile robot is in its ability to the online image processing, object detection, and especially visual tracking which is a complex task in stochastic environments. Tracking algorithms suffer from sequence challenges such as illumination variation, occlusion, and background clutter, so an a...
متن کاملA Comparative Study of Local Descriptors for Object Category Recognition: SIFT vs HMAX
In this paper we evaluate the performance of the two most successful state-of-the-art descriptors, applied to the task of visual object detection and localization in images. In the first experiment we use these descriptors, combined with binary classifiers, to test the presence/absence of object in a target image. In the second experiment, we try to locate faces in images, by using a structural...
متن کاملEfficient Scale and Rotation Invariant Object Detection Based on HOGs and Evolutionary Optimization Techniques
Object detection and localization in an image can be achieved by representing an object as a Histogram of Oriented Gradients (HOG). HOGs have proven to be robust object descriptors. However, to achieve accurate object localization, one must take a sliding window approach and evaluate the similarity of the descriptor over all possible windows in an image. In case that search should also be scale...
متن کامل’Histograms of Oriented Gradients for Human Detection’ versus ’Fast Human Detection Using a Cascade of Histograms of Oriented Gradients’
Dalal and Triggs [1] studied the question of feature sets for robust visual object recognition. They first considered existing edge and gradient based descriptors and then they showed experimentally that grids of Histograms of Oriented Gradients (HoG) descriptors significantly outperform existing feature sets for human detection. After this they studied the influence of each stage of the comput...
متن کامل